Search CORE

54 research outputs found

Predicting gene expression in T cell differentiation from histone modifications and transcription factor binding affinities by linear mixture models

Author: A Dempster
AD Goldberg
B Georgi
BM Turner
D Smedley
DJC MacKay
DR Barreda
E Woolf
EV Rothenberg
Francisco de AT de Carvalho
G Wei
GE Hinton
GJ McLachlan
H Ji
H Zou
Helge G Roider
HG Roider
HG Roider
HJ Bussemaker
HJ Bussemaker
IG Costa
IG Costa
Ivan G Costa
J Zhu
JR Schoenborn
L Breiman
M Bibikova
MJ Brusco
P Matthias
R Karlic
S Keles
T Kouzarides
Thais G do Rego
V Matys
W DeSarbo
W Huber
X Mo
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Statistical Modeling of Transcription Factor Binding Affinities Predicts Regulatory Interactions

Author: B Ren
C Dieterich
C Tuerk
D Galas
D Johnson
DC King
F Gao
G Courtois
Helge G. Roider
HG Roider
M Fried
Martin Vingron
Michael Levitt
O Berg
P von Hippel
R Staden
S Coles
S Rahmann
SE Girardin
T Hisamatsu
Thomas Manke
TI Lee
TJP Hubbard
V Matys
W Feller
WE Johnson
Z Bar-Joseph
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

Recent experimental and theoretical efforts have highlighted the fact that binding of transcription factors to DNA can be more accurately described by continuous measures of their binding affinities, rather than a discrete description in terms of binding sites. While the binding affinities can be predicted from a physical model, it is often desirable to know the distribution of binding affinities for specific sequence backgrounds. In this paper, we present a statistical approach to derive the exact distribution for sequence models with fixed GC content. We demonstrate that the affinity distribution of almost all known transcription factors can be effectively parametrized by a class of generalized extreme value distributions. Moreover, this parameterization also describes the affinity distribution for sequence backgrounds with variable GC content, such as human promoter sequences. Our approach is applicable to arbitrary sequences and all transcription factors with known binding preferences that can be described in terms of a motif matrix. The statistical treatment also provides a proper framework to directly compare transcription factors with very different affinity distributions. This is illustrated by our analysis of human promoters with known binding sites, for many of which we could identify the known regulators as those with the highest affinity. The combination of physical model and statistical normalization provides a quantitative measure which ranks transcription factors for a given sequence, and which can be compared directly with large-scale binding data. Its successful application to human promoter sequences serves as an encouraging example of how the method can be applied to other sequences

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

MPG.PuRe

The Influence of Transcription Factor Competition on the Relationship between Occupancy and Affinity

Author: A Marcovitz
Boris Adryan
CC Fowlkes
D Chu
DT Gillespie
DT Gillespie
DT Gillespie
E Segal
Frances M. Sladek
GD Stormo
GD Stormo
GD Stormo
GK Ackers
H Flyvbjerg
HG Roider
J Elf
J Zeitlinger
JS van Zon
L Bintu
L Bintu
L Mirny
M Djordjevic
M Hedglin
M Kampmann
M Riley
M Santillan
MD Biggin
N Rosenfeld
Nicolae Radu Zabet
NR Zabet
NR Zabet
NR Zabet
NR Zabet
OG Berg
OG Berg
P Hammar
PH von Hippel
R Hermsen
Robert Foy
S Thomas
SJ Maerkl
T Kaplan
T Raveh-Sadka
T Wasson
U Gerland
Y Zhao
Z Wunderlich
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 27/03/2013
Field of study

Transcription factors (TFs) are proteins that bind to specific sites on the DNA and regulate gene activity. Identifying where TF molecules bind and how much time they spend on their target sites is key to understanding transcriptional regulation. It is usually assumed that the free energy of binding of a TF to the DNA (the affinity of the site) is highly correlated to the amount of time the TF remains bound (the occupancy of the site). However, knowing the binding energy is not sufficient to infer actual binding site occupancy. This mismatch between the occupancy predicted by the affinity and the observed occupancy may be caused by various factors, such as TF abundance, competition between TFs or the arrangement of the sites on the DNA. We investigated the relationship between the affinity of a TF for a set of binding sites and their occupancy. In particular, we considered the case of the transcription factor lac repressor (lacI) in E.coli, and performed stochastic simulations of the TF dynamics on the DNA for various combinations of lacI abundance and competing TFs that contribute to macromolecular crowding. We also investigated the relationship of site occupancy and the information content of position weight matrices (PWMs) used to represent binding sites. Our results showed that for medium and high affinity sites, TF competition does not play a significant role for genomic occupancy except in cases when the abundance of the TF is significantly increased, or when the PWM displays relatively low information content. Nevertheless, for medium and low affinity sites, an increase in TF abundance (for both cognate and non-cognate molecules) leads to an increase in occupancy at several sites. © 2013 Zabet et al

University of Essex Research Repository

arXiv.org e-Print Archive

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Queen Mary Research Online

FigShare

Inferring Binding Energies from Selected Binding Sites

Author: A Sarai
AE Kel
C Tuerk
Christopher Workman
DA Gilchrist
David Granas
DS Fields
DSF Homsi
E Roulet
E Sharon
Gary D. Stormo
GD Stormo
GD Stormo
GD Stormo
GD Stormo
H Ji
HF Teh
HG Roider
J Linnell
J Liu
JB Kinney
JJ Moré
L van Oeffelen
M Djordjevic
M Djordjevic
MF Berger
ML Lee
MQ Zhang
O Berg
PH von Hippel
PV Benos
PV Benos
Q Zhou
R Staden
SJ Maerkl
TH Cormen
TK Blackwell
TK Man
U Gerland
V Mustonen
VH Nagaraj
WE Wright
X Liu
X Meng
Y Takeda
Yue Zhao
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

We employ a biophysical model that accounts for the non-linear relationship between binding energy and the statistics of selected binding sites. The model includes the chemical potential of the transcription factor, non-specific binding affinity of the protein for DNA, as well as sequence-specific parameters that may include non-independent contributions of bases to the interaction. We obtain maximum likelihood estimates for all of the parameters and compare the results to standard probabilistic methods of parameter estimation. On simulated data, where the true energy model is known and samples are generated with a variety of parameter values, we show that our method returns much more accurate estimates of the true parameters and much better predictions of the selected binding site distributions. We also introduce a new high-throughput SELEX (HT-SELEX) procedure to determine the binding specificity of a transcription factor in which the initial randomized library and the selected sites are sequenced with next generation methods that return hundreds of thousands of sites. We show that after a single round of selection our method can estimate binding parameters that give very good fits to the selected site distributions, much better than standard motif identification algorithms

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Digital Commons@Becker

Functional Characterization of Transcription Factor Motifs Using Cross-species Comparison across Large Evolutionary Distances

Author: A Louvi
A Rogulja-Ortmann
A Stark
AC Edwards
AE Kel
AI Su
AS Adler
AV Morozov
B van Steensel
BD McCabe
BP Berman
BP Berman
Brian James
C Kwong
C Rushlow
C Rushlow
C van Waveren
CM Bergman
CT Harbison
CW Whitfield
CW Whitfield
D Karolchik
D Porcelli
D Vlieghe
DE Newburger
E Kurant
E Segal
EJ Ward
Evgeny M. Zdobnov
F Casares
GD Stormo
Gene E. Robinson
HG Roider
HG Roider
HM Berman
Hugh M. Robertson
J DeZazzo
J Pinnell
J Wang
J Zeitlinger
J Zhu
JA Lynch
Jaebum Kim
JB Warner
JD Gibson
JD Storey
JG Gindhart Jr
JH Werren
John H. Werren
Joshua D. Gibson
JR Desjarlais
JZ Parrish
KM Bhat
LA Pennacchio
LD Ward
LF Sempere
LW Chang
M Ashburner
M Blanchette
M Boden
M Delorenzi
M Kanehisa
M Kellis
MA Crosby
MB Noyes
MB Noyes
MC Frith
MC Frith
ME Fortini
MS Halfon
N Rajewsky
Oliver Niehuis
P Kheradpour
PK Sorger
R Garesse
R Gordân
RC Scarpulla
RD Finn
Ryan Cunningham
S Grossmann
S Robin
S Roy
S Sinha
S Sinha
S Sinha
SA Ramsey
Saurabh Sinha
Stefan Wyder
T Berleth
TE Creighton
TL Bailey
U Keich
V Matys
W Huang da
WC Xiong
WJ Nelson
WW Wasserman
Wyeth W. Wasserman
X Xie
X Zhou
XY Li
Y Haraguchi
Z Huang
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

We address the problem of finding statistically significant associations between cis-regulatory motifs and functional gene sets, in order to understand the biological roles of transcription factors. We develop a computational framework for this task, whose features include a new statistical score for motif scanning, the use of different scores for predicting targets of different motifs, and new ways to deal with redundancies among significant motif–function associations. This framework is applied to the recently sequenced genome of the jewel wasp, Nasonia vitripennis, making use of the existing knowledge of motifs and gene annotations in another insect genome, that of the fruitfly. The framework uses cross-species comparison to improve the specificity of its predictions, and does so without relying upon non-coding sequence alignment. It is therefore well suited for comparative genomics across large evolutionary divergences, where existing alignment-based methods are not applicable. We also apply the framework to find motifs associated with socially regulated gene sets in the honeybee, Apis mellifera, using comparisons with Nasonia, a solitary species, to identify honeybee-specific associations

Public Library of Science (PLOS)

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

Archive ouverte UNIGE

RNAcontext: A New Method for Learning the Sequence and Structure Binding Preferences of RNA-Binding Proteins

Author: AP Gerber
BC Foat
BC Foat
BC Foat
C Shin
CT Workman
D Ray
DE Tsai
Debashish Ray
DJ Hogan
E Segal
EM Conlon
Esther T. Chan
FB Gao
FC Oberstrass
G Badis
HG Roider
Hilal Kazan
I Perez
IL Silanes
J Hackermuller
J Ule
JA Granek
JD Keene
JD Keene
JR Sanford
KE Lukong
L Wickham
M Blanchette
M Hiller
M Rabani
MF Berger
MJ Law
ML Bulyk
MT Miller
NC Meisner
P Benos
Quaid Morris
R Tacke
RH Byrd
RJ Buckanovich
S Griffiths-Jones
S Sinha
Saurabh Sinha
SR Eddy
SR Eddy
T Aviv
Timothy R. Hughes
TL Bailey
TM Bailey
X Chen
X Li
X Wang
Y Ding
Y Wang
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Metazoan genomes encode hundreds of RNA-binding proteins (RBPs). These proteins regulate post-transcriptional gene expression and have critical roles in numerous cellular processes including mRNA splicing, export, stability and translation. Despite their ubiquity and importance, the binding preferences for most RBPs are not well characterized. In vitro and in vivo studies, using affinity selection-based approaches, have successfully identified RNA sequence associated with specific RBPs; however, it is difficult to infer RBP sequence and structural preferences without specifically designed motif finding methods. In this study, we introduce a new motif-finding method, RNAcontext, designed to elucidate RBP-specific sequence and structural preferences with greater accuracy than existing approaches. We evaluated RNAcontext on recently published in vitro and in vivo RNA affinity selected data and demonstrate that RNAcontext identifies known binding preferences for several control proteins including HuR, PTB, and Vts1p and predicts new RNA structure preferences for SF2/ASF, RBM4, FUSIP1 and SLM2. The predicted preferences for SF2/ASF are consistent with its recently reported in vivo binding sites. RNAcontext is an accurate and efficient motif finding method ideally suited for using large-scale RNA-binding affinity datasets to determine the relative binding preferences of RBPs for a wide range of RNA sequences and structures

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

BayesPI - a new model to study protein-DNA interactions: a case study of condition-specific protein binding parameters for Yeast transcription factors

Author: A Delaunay
A Tanay
A Yarragudi
AE Tsong
AR Borneman
B Alberts
BC Foat
BE Bernstein
C Moorman
CK Lee
CT Harbison
CY Chen
D Das
D Das
D Mackay
DC Raitt
DS Fields
E Aurell
E Wingender
F Gao
F Ozsolak
G Tuteja
GL Bond
HG Roider
HJ Bussemaker
HK Tsai
I Nabney
J Deckert
J Lee
J Wang
J Wang
J Wang
J Zeitlinger
JB Kinney
JM Bland
JM Cherry
Junbai Wang
K Murphy
KD MacIsaac
L Jen-Jacobson
L Narlikar
L Segal
M Djordjevic
MJ Buck
ML Bulyk
Morigen
MP Ryan
O Sertil
OG Berg
PV Benos
Q Zhou
RD Kornberg
RF Lascaris
RH Morse
S Ghaemmaghami
S Keles
SF Gull
TE Cheatham 3rd
TK Man
TN Mavrich
U Gerland
U Gerland
VB Zhurkin
W Gorner
W Lee
WK Olson
X Liu
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background We have incorporated Bayesian model regularization with biophysical modeling of protein-DNA interactions, and of genome-wide nucleosome positioning to study protein-DNA interactions, using a high-throughput dataset. The newly developed method (BayesPI) includes the estimation of a transcription factor (TF) binding energy matrices, the computation of binding affinity of a TF target site and the corresponding chemical potential. Results The method was successfully tested on synthetic ChIP-chip datasets, real yeast ChIP-chip experiments. Subsequently, it was used to estimate condition-specific and species-specific protein-DNA interaction for several yeast TFs. Conclusion The results revealed that the modification of the protein binding parameters and the variation of the individual nucleotide affinity in either recognition or flanking sequences occurred under different stresses and in different species. The findings suggest that such modifications may be adaptive and play roles in the formation of the environment-specific binding patterns of yeast TFs and in the divergence of TF binding sites across the related yeast species.</p

Helsebibliotekets Research Archive

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

A computational evaluation of over-representation of regulatory motifs in the promoter regions of differentially expressed genes

BACKGROUND: Observed co-expression of a group of genes is frequently attributed to co-regulation by shared transcription factors. This assumption has led to the hypothesis that promoters of co-expressed genes should share common regulatory motifs, which forms the basis for numerous computational tools that search for these motifs. While frequently explored for yeast, the validity of the underlying hypothesis has not been assessed systematically in mammals. This demonstrates the need for a systematic and quantitative evaluation to what degree co-expressed genes share over-represented motifs for mammals. RESULTS: We identified 33 experiments for human and mouse in the ArrayExpress Database where transcription factors were manipulated and which exhibited a significant number of differentially expressed genes. We checked for over-representation of transcription factor binding sites in up- or down-regulated genes using the over-representation analysis tool oPOSSUM. In 25 out of 33 experiments, this procedure identified the binding matrices of the affected transcription factors. We also carried out de novo prediction of regulatory motifs shared by differentially expressed genes. Again, the detected motifs shared significant similarity with the matrices of the affected transcription factors. CONCLUSIONS: Our results support the claim that functional regulatory motifs are over-represented in sets of differentially expressed genes and that they can be detected with computational methods

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

MPG.PuRe

Identification of Novel Pax8 Targets in FRTL-5 Thyroid Cells by Gene Silencing and Expression Microarray Analysis

Author: A Kikuchi
A Mansouri
AD Bradshaw
Anna Conti
AR Marques
B D'Andrea
B Zhang
C Brisken
C Provenzano
C Puppin
C Wu
CB Cui
D Civitareale
D Plachov
DA Whyte
E Torban
ES Tien
FS Ambesi-Impiombato
G Cali
G Damante
HG Roider
J Braganca
JH Hong
K Stark
KS Park
Lucio Nitsch
M De Felice
M Murakami
M Murakami
M Narlis
M Ohno
M Pasca di Magliano
M Zannini
Mahoney WM Jr
Marian Ludgate
Mariastella Zannini
MC D'Adamo
NJ Bowen
P Zacchi
PE Macchia
QY Lei
S Bhattacharya
S Draghici
S Vainio
SC Mok
Serena Scala
T de Cristofaro
T Di Palma
T Di Palma
T Di Palma
T Di Palma
T Endo
T Endo
T Miyakoshi
TF Lane
Tina Di Palma
Tiziana de Cristofaro
U Tepass
WK Chia
Y Nishida
YJ Chen
YR Gonzalez
Z Hegedus
ZZ Wu
Publication venue: Public Library of Science
Publication date: 23/09/2011
Field of study

The differentiation program of thyroid follicular cells (TFCs), by far the most abundant cell population of the thyroid gland, relies on the interplay between sequence-specific transcription factors and transcriptional coregulators with the basal transcriptional machinery of the cell. However, the molecular mechanisms leading to the fully differentiated thyrocyte are still the object of intense study. The transcription factor Pax8, a member of the Paired-box gene family, has been demonstrated to be a critical regulator required for proper development and differentiation of thyroid follicular cells. Despite being Pax8 well-characterized with respect to its role in regulating genes involved in thyroid differentiation, genomics approaches aiming at the identification of additional Pax8 targets are lacking and the biological pathways controlled by this transcription factor are largely unknown.To identify unique downstream targets of Pax8, we investigated the genome-wide effect of Pax8 silencing comparing the transcriptome of silenced versus normal differentiated FRTL-5 thyroid cells. In total, 2815 genes were found modulated 72 h after Pax8 RNAi, induced or repressed. Genes previously reported to be regulated by Pax8 in FRTL-5 cells were confirmed. In addition, novel targets genes involved in functional processes such as DNA replication, anion transport, kinase activity, apoptosis and cellular processes were newly identified. Transcriptome analysis highlighted that Pax8 is a key molecule for thyroid morphogenesis and differentiation.This is the first large-scale study aimed at the identification of new genes regulated by Pax8, a master regulator of thyroid development and differentiation. The biological pathways and target genes controlled by Pax8 will have considerable importance to understand thyroid disease progression as well as to set up novel therapeutic strategies

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central